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Abstract 

An (encoded) decision problem over S is a pair (£', F) where 
£'=words that encode instances of the problem, i^=words to be ac- 
cepted. We use strings in a technical sense, borrowed from Com- 
putability. With any NP problem (E, F) we associate a set of strings 
\LogE{F)\ called the reduced logogram of F relative to E, which con- 
veys structural information on E, F, and how F is embedded in E. 
We define notions of internal independence of decision problems in 
terms of \LogE{F)\. The kernel Ker{P) of a program P that solves 
{E,F) is the set of those strings in \LogE{F)\ that are actually used 
by P in making decisions. There are strict relations between Ker{P) 
and the complexity of P. 

We develop an application to SAT that relies upon a property of 
strong internal independence of SAT. We show that SAT cannot have 
in its reduced logogram certain strings that, when present, serve as 
collective certificates. As a consequence, all the programs that solve 
SAT have the same kernel Ker{P) = \LogcNF{SAT)\. 

1 Introduction 

We develop an application to SAT, to be positioned in current stream of 
interest in the structure of Boolean satisfiability [1]. We use strings in a 
technical sense, borrowed from Computability [2]. 

An encoded decision problem over S is a pair [E, F) where i?=words 
that encode instances of the problem, F=words to be accepted. On input x, 
a decision program P for {E, F) either accepts x (if x is in F) or rejects x 
(if X is in E — F) or else discards x (for x outside E) . 

Our fundamental construct is a set of strings LogE{F) called logogram of 
F relative to E that conveys structural information on E, F, and how F is 
embedded in E. We mostly use the reduced version \LogE{F)\, consisting of 
those strings in LogE{F) that do not include other strings in LogE{F). The 
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kernel Ker{P) of a program P that solves {E, F) is the set of those strings 
in \LogE{F)\ that are actually used by P in making decisions. There are 
strict relationships between the composition in terms of strings of the kernel 
Ker{P) of a program solving {E, F) and the complexity of P. 

Our application to SAT uses a property of internal independence of a 
decision problem that we call "strong internal independence." Think of a 
computation in which the result of any computation step does not change 
the results that are possible for subsequent steps. Internal independence is 
defined in terms of a relation of entanglement between sets of strings 
relative to reference set E. 

Our main results are the following. We show that {CNF, SAT) exhibits 
the strong internal independence property: Intuitively, no "entanglement at 
distance" between strings in \LogcNF{SAT)\ is possible. Besides, we show 
that problem {CNF, SAT) cannot have, in its reduced logogram, certain 
collective certificates that we call wizards. As a consequence, the decision 
programs P that solve {CNF, SAT) all have the same kernel Ker{P) = 
\LogcNF{SAT)\. 

2 Certificates of Membership as Strings 

We first recall notions regarding the certificates of membership in NP theory. 
As next step, we illustrate possible use of strings to represent certificates. We 
conclude the section reviewing basic algebraic properties of strings. 

Let G C E* X E* so that G is a relation on words over E. Let Dom{G) 
and Cod{G) be first and second projection of G. A relation G which is both 
polynomial-time decidable and polynomially balanced is an NP relation. L is 
in NP if and only if there exists an NP relation C such that L = Dom{C). We 
interchange problems with languages: {E, F) e NP and F e NP amount to 
the same. 

Let {E,F) be an NP problem. Then there exists a sequence yi, y2,.. of 
words (over some appropriate alphabet) called solutions or else certificates 
of membership for problem {E,F). For any problem instance x & E we 
have that x can possibly be satisfied by some of the yiS. We also have 
"unsatisfiable" instances. What "satisfaction" means operationally is proper 
of problem {E, F). 

Cardinality function a{n) of an NP problem: We may arrange notations 
so that all solutions that can possibly satisfy an x of size n are between yi 
and ya{n)- 

Associated with solutions yi, y2,-. there is a decomposition of target set 
F into subsets Fj called regions, where Fj is the set of those x's that are 
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satisfied by Regions satisfy tlie obvious relation F = UjFj. 

2.1 Generalized Certificates 

In tliis paper we replace certificates with generalized certificates. These are 
represented by strings, defined to be functions — > E with finite domain 
(A^=positive integers). In loose words, a string g being included (or sub- 
sumed) in a word x is that which remains by canceling zero or more letters 
in X, while leaving blanks in places of letters. Note that words are certain 
special strings, thus the solutions yi, 1/2,.. continue to be certificates. This 
generalization allows us to introduce certain more general certificates that 
we call wizards. 

We assume that satisfiability, being a property exhibited by certain words, 
is accompanied by characteristic signs, that we think as distinctive marks, 
or signatures, being somehow inscribed within the word x under study. A 
detailed discussion would yield strings as proper formalization of such notions 
as "mark" or "signature." Thus, we assume that signs are strings interspersed 
in X. Since strings represent words in shorthand, we call their set a logogram. 

2.2 Strings 

We define S^o to be the set of all strings over S. Look at g E as a 
prescription that a word x over S may or may not satisfy. If Dom{g) is an 
initial segment of N then g is an ordinary word: Thus, words are certain 
special strings. The length (or size) \g\ is the greatest number in Dom{g). 

Eoo is partially ordered. Given f,gG Eqo, g is an extension of /, written 
/ < as soon as Dom{f) C Dom{g) and g takes same values as / in 
Dom{f). li f < g and g < f then f = g. li f ^ g but not g < f, write 
f < g and say g is a proper extension of /, or else / is a proper restriction 
of g. The empty partial function N ^ , noted _L, is the void string, and 
Dom{l.) = 0. Any / in Sqo is an extension of _L, thus (Sqo, <) has a least 
element _L. 

Two strings / and g are compatible as soon as f{x) = g{x) for any x in 
Dom{f)nDom{g). If /, (yf are disjoint, which is to say _Dom(/)nDom(5f) = 0, 
then / and g are certainly compatible. The meet f /\g oi any pair /, g is the 
restriction of / (or g) to that portion of the intersection Dom{f) fl Dom{g) 
where / and g agree. The join of two compatible strings f,g, noted f + g, 
is the least string which is an extension of both / and g. Thus f,g<f + g 
and Dom{f + g) = Dom{f) U Dom{g). Equipped with meet and join, "£^0 is 
an upward directed complete meet-semilattice 
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3 Entanglement among Strings 



The cylinders defined below are as in Computability (the formalism is slightly 
different). The logogram is a newcomer in Computer Science. Entanglement 
is a key concept to deal with internal structure of computational problems. 

3.1 Cylinders 

Given H C Eqo we define 

Exp{H) = {x e E* : 3a e // {x > a)} (1) 

Thus, Exp{H)=set of all words that include strings from H. Call Exp{H) 
absolute expansion, equivalently, absolute cylinder associated with H. Note 
that Exp{H) is the union of the elementary cylinders Exp{g) for g E H. 

Given any recursive set of words E, we write Tj^(E) for the set of all 
strings that happen to be included in words of E, thus 

^oo(E) = e Eoo : Exp{g) n E 7^ 0} (2) 

Soo(-E') is the set of those strings g in Eqo whose associated cylinder Exp{g) 
intersects E. We think of E as the set of words over E that encode instances 
of some fixed reference computational problem 11. (Whenever we talk of a 
reference set E there is imphcit reference to some fixed abstract decision 
problem 11 as well as to a program P solving 11.) For H C Eoo(£') we write 

Expe{H) = E^^ = {x e E ■.3a e H {x>a)} = En Exp{H) (3) 

Thus, E^ is the set of those words in E that contain strings from H. 
E^ is the expansion of H relative to base E. We actually regard E^ as a 
relativized cylinder, equivalently, as being a cylinder relative to a reference 
set E. Note that for = E* we regain the absolute expansion of set H. 

Given H C T,^(E), correspondence ExpE ■ H — > E^ exhibits properties: 

£;^u£;^ = £;^^^, n = (4) 

Thus, unions and intersections of sets that are cylinders relative to refer- 
ence set E are cylinders in E. Also note that, for any H,K & T,^{E), 

H <ZK^E" <ZE^ (5) 
Expe{Expe{H)) = Expe{E") = E^ (6) 
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3.2 Logograms 

In this section we introduce the logogram of a set of words F relative to a 
reference set E. Given F <0 E, we define 

LogE{F) = {ge i:^{E) : e E{x > g ^ x e E^)} (7) 

Since ordinary words are strings, F can be regarded as a set of strings, hence 
E^ is defined. E^ is the relative cylindrification of F in E, and this in turn 
is the set of all words in E that are prefixed by words in F. 

Remark If F is a cylinder in E, i.e., F = E^ for some H C J]^{E), then 
E^ = F hj Equation [6l 

Remark For E = T,* Equation [7| gives absolute logogram. 

The main property of correspondence Logs '■ E — * Soo(-E) is the following. 
Given A,B CE, 

LogE{A UB)D LogE{A) U LogsiB). (8) 

Let us understand this inclusion. LogE{A U B) is the set of all strings that, 
for X in E, are able to trigger event x G E^^^ = E"^ U E^ . A string that 
triggers x G E"^ certainly belongs to LogE^AUB). Analogously, a string that 
triggers x G E^ certainly belongs to LogE^AUB). Thus, LogE{A)ULogE{B) 
certainly is a subset of LogE{A U B). However, there can be strings / whose 
inclusion in a word x G -E is a sufficient condition for event x G E"^ U E^ but 
not for X G E"^ or x G E^ . Thus, in the general case LogE^A U B) is not the 
same set as LogE^A) U LogE^B). 

3.3 Entanglement 

The presence of certain strings in a word may entail that of certain others. 
Given H,K (1 I]^[E), we write K H if the following happens: Every 
word in E which includes strings from K also includes strings from H. (Think 
of strings in K as spies, or else symptoms, for presence in an input string x 
of strings from H.) If H K and K H then we write H =^ K and 
say that if, K are isoexpansive relative to E. Clearly, =^ is an equivalence 
relation. It is easily seen that H =^ K if and only if E^ = E^ . 

For ii^ = E* we rewrite as □ and =^ as =. Note that / □'^ g if and 
only if every word x (within E) which includes / also includes g. 

We mention a few easy facts, (i) If f < g then / g for any possible 
E. (ii) In the general case / g does not imply f < g. (It is well possible 
that this holds for specific sets E. For example, if = S* then / g if and 
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only if / < g.) (iii) If f,g are incompatible, then it cannot be that / g. 
(iv) Given any H,K C T,oo{E), 

HCK^H^K^H^^K (9) 

We ask: Is there any easy piece of algebra linking expansion, logogram, 
entanglement? To get an answer, we define a Galois connection that will 
provide us with a closure operation in Soo(-E'), noted H . We will 

see that H and H""^ are isoexpansive relative to E. What more, there can 
be distinct subsets K,I,.. of H being isoexpansive (mod E) to while 
possibly exhibiting different computational behaviors. 

We define our connection to be a pair (a,/?) of correspondences between 
sets of strings and sets of words. The first correspondence a carries a set of 
strings H C Soo(-E) into a corresponding set of words C E. The second 
carries a set of words A <Z E into a set of strings C T,oo{E) according to 

H K ^ H'^DK'' (10) 

ACB^Af^^^Bf^ (11) 
H H""^, A C A^'^ (12) 
The connection is formally defined through the explicit expressions: 

H'^ = E" (13) 

A^ = LogEiA) (14) 

We emphasize that A is any subset of E. Thus, given any subset A of the 
reference set E the function A^ = LogE{A) is defined. However, not all 
subsets A of E happen to be the conjugate set of some set H C T,oo{E). 
If that happens, we say that A is closed. Note that A closed implies E^ = A. 

Theorem 1. {a, (3) is a Galois connection. 

Proof. We must derive Equations fT0lfT2] from Equations fT3lfT^ 

(I) Let H,K <Z Eoo(-E) be given, and assume H C-^ K. 

Let g E K, and let x be any word in E such that x > g. Then x G E^ 
hence x G K"'. Since H K, there exists f E H such that x > f. Then x 
is in E^ hence x G H°' . Equation [TU] is proved. 

(II) Next, we prove Equation [Til Let A,B C E and assume A ^ B. 

We must prove that if a word x E E includes a string g from A^ then x 
also includes a string / from B^. 

Let g G LogE^A) so that g E A^ hy Equation 
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Thus, for all X E E we have x > g ^ x E E"^. But A C i?, hence 
X e E^ ^ X e E^ hy Equation 

Thus, for all X G -E we have x > g ^ x & E^ . By Equation [TH this is to 
say g G LogE{B) = . We have shown that C . Equation [TT] follows 
by virtue of Equation [9l 

(III) Next, we prove the first of Equations \n\ 

Let g G iJ"^ and let x G be any string in E such that x > g. 

We have H"" = E" hence H"^^ = LogsiE^). Thus, g G LogsiE^). 
By Equation [7| we have x G Expe{E^), and then, by virtue of Equation [6], 
X G E^. 

We conclude that there exists f & H such that x > /. 

(IV) Next, we prove the second of Equations [T21 It follows from Equations 
[HI [H that A^" = E^°9e{A)_ On other hand one has A C E^"^'^^^^ for any 
A <Z E. Indeed, A <Z E^ and E^ = E^"^'^'-^^ from definitions, taking into 
account Equation El We proved the second of Equations [121 □ 

Theorem 2. The following equations hold: 

Proof. From theory of Galois connection [3] . □ 

Theorem 3. The map H — > if"^ is a closure operation in Soo(-E), and 
A A^°' is a closure operation in E. 

Only for a closed A do we have that, for all x in x G v4 if and only if 
there exists a g E LogE^A) such that x > g. If and only if A is a closed subset 
of the reference set E we define the reduced kernel \LogE{A)\. Regarding the 
reduced kernel \LogE{A)\ of a closed set A C E, we explicitly note that, for 
any x in i?, x G A if and only if there is G \LogE{A)\ such that x > g. 

4 The Kernel of a Decision Program 

We begin with a few remarks on the nature of the strings that happen to 
occur in the reduced logogram \LogE{E)\. For any NP decision problem 
[E, F) we assume F to be a relative cylinder in E. (It is known that SAT 
is a cylinder [5].) The strings in \LogE{F)\ are certificates of membership 
for F relative to E: For words in E, to include one or more strings from 
\LogE{F)\ is necessary and sufficient for membership in F. In principle, we 
cannot exclude that \LogE{F) \ may contain strings that behave as collective 
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witnesses, also called wizards. (There exist problems, e.g. PRIMES, where 
\LogE{F)\ has wizards.) In that case a program P solving {E,F) might do 
calculations that are functionally equivalent to testing input x for wizards. 

Let P solve problem {E,F). The computations that P performs are 
functionally equivalent to sequences of tests done on input x. This is part of 
Scott's view of computations [6] (The term "test" is ours: Dana Scott 
uses "token" or else "piece of information" according to context.) Note that 
Scott's theory is consistent with our developments as soon as we identify 
Scott's tokens with strings. In this view what P actually does is searching 
the input x for strings in \LogE{F)\. That yields a view of computations as 
sequences of tests in disguise. 

Let program P solve problem {E,F). The tests in \LogE{F)\ are those 
that P can use: They are so to speak at disposal for a program P. Which of 
these tests are actually used by P is a different story. We define the kernel of 
program P, noted Ker{P), to be the set of the strings from \LogE{F)\ that 
P actually uses for making decisions. The strings in Ker{P) are uniquely 
identified by the algorithm that P implements. The composition of Ker{P) 
in terms of strings can also be determined through experiments with the 
executable of P. 

A concept of great relevance for sequel is that of a complete subset of 
the reduced logogram \LogE{F) \ of decision problem {E, F): We define a set 
H C \LogE{F) \ to be complete for problem {E, F) as soon as, for any x E E, 
one has x e F <^ 3f e H{f < x). 

The proofs of following two theorems are not difficult and are omitted. 

Theorem 4. A necessary condition for P to correctly solve {E, F) is Ker{P) 
complete for {E, F). 

Let H C \LogE{F)\ be a complete set of strings for {E,F). We define H 
to be irreducible for {E, F) as soon as no proper subset K <Z H happens to 
be complete for {E,F). 

Theorem 5. Let \LogE{F)\ be irreducible and programs P,Q both solve 
{E,F). Then Ker{P) = Ker{Q). 

5 Independence of Decision Problems 

We first introduce a notion of pairwise independence of strings relative to a 
reference set E. As next step, we define a notion of internal independence of 
set E. Next we define notions of internal and strong internal independence 
of a decision problem. 
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Mutual Independence of Strings Let /, g be any two strings in 11^[E) 
where E is any infinite recursive set of words over alphabet E. According to 
definitions, / entangles g relative to E as soon as, for all x e x > / =^ 
X > g. We agreed that / □■^ g means that / entangles g relative to E. 

Observe that / fails to entangle g relative to E if and only if there ex- 
ists X E E such that x contains / and does not contain g. If / 2^ 9 and 
g 2^ / then / and g are said mutually independent relative to E; f and g 
are mutually dependent relative to E when they fail to be mutually indepen- 
dent relative to E. If /, g are incompatible, then certainly /, g are mutually 
independent relative to any E. 

Independence of a Recursive Set Our next step is to define the internal 
independence of a recursive set E. We define E to be internally independent 
as soon as, given any f,gE T,oo{E) one has / g if and only if / is part of 
g, that is to say, if and only if f < g. 

Independence of a Decision Problem Now we arc ready to introduce 
the simple internal independence of a decision problem {E, F). We caU {E, F) 
internally independent as soon as the strings in \LogE{F)\ are mutually 
independent taken two by two. 

Theorem 6. IfE is internally independent then any decision problem {E, F) 
based on E as reference set exhibits the simple internal independence property. 

Proof. Let E be any infinite recursive set exhibiting the internal indepen- 
dence property. Let {E, F) be any decision problem based on E as reference 
set. Let f,g be any two strings in the reduced logogram \LogE{F)\ of the 
problem. 

(I) Assume f,g incompatible. Since / G HooiE), we have Er\Exp{f) ^ 0. 
Let X E E n Exp{f). Then x is in E, x includes / and does not include g. 
Analogously, one can find a. y E E which includes g and does not include /. 
Thus, /, g are mutually independent in E. 

(II) Assume /, g compatible. By the minimality property of the reduced 
logogram \LogE{F)\ it cannot be that f > g. By the internal independence 
of the reference set E one has / g if and only ii f > g. Then, it also 
cannot be the case that / g. As a consequence, there exists x E E which 
includes / and does not include g. Analogously, there exists y E E which 
includes g and does not include /. Thus, again we have that /, g are mutually 
independent. 

We conclude that problem [E, F) exhibits the simple internal indepen- 
dence property. □ 
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Strong Independence of a Decision Problem Let us now come to the 
strong internal independence of a decision problem. We know that {E, F) 
is internally independent as soon as the strings in its reduced logogram 
\LogE{F)\ are mutually independent taken two by two. 

The simple internal independence of a decision problem {E, F) certainly 
is a form of internal independence of a decision problem, but we may indeed 
ask for more independence: We may ask for independence of the elements of 
the reduced kernel \LogE{F)\ taken m by m all m. The following notion of 
internal independence of a decision problem captures this extreme form of 
internal independence of a problem. 

We shall say that the decision problem [E, F) exhibits the property of 
strong internal independence if, for any choice of s distinct strings fi, ■■, fs 
in \LogE{F)\, the following is true: For every i between 1 and s there exists a 
word Xi & E such that Xi contains fi and fails to contain any of the remaining 
strings in {/i, .., It is left for the reader to show that strong internal 
independence of a decision problem implies simple internal independence. 

6 Witnesses and Wizards 

From Equation [8] we have 

LogEAi U .. U LogEAm C LogE{Ai U .. U Am) (17) 
for closed Ai, .., ^ E. Now replace m with a{n) and Ai with F^ : 

LogEFi U .. U LogEF^i^^) C LogE{Fi U .. U F„(„)) (18) 
The strings in LogEFi, LogEFa[n) are witnesses. The possible strings in 

LogE^Fi U .. U F,(„)) - LogEFi U .. U LogEF^^^) (19) 

we call "wizards" since they are so to speak able to perceive that an input x 
shall be in someone of the FiS but couldn't say which. The possible existence 
of this type of strings in the reduced logogram \LogE{F) \ of a decision prob- 
lem {E, F) can be demonstrated by examples. Wizards have been found to 
exist in the reduced logograms of following problems (i) To decide if a sym- 
metric loopfree graph is connected, (ii) To decide if a given positive integer 
is composite (note, incidentally, that PRIMES is in P [8J). 

In a situation in which the target set F is decomposed according to 
U„F„ = F, the witnesses are always there in the reduced logogram of set 
F relative to E. On the contrary, the wizards may be missing. It pertains to 
the structure of the computational problem at hand whether the target set 
F has wizards. We conclude this section proving a theorem: 
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Theorem 7. IfF = U„F„ where the FiS are cylinders in E, then U^" |Lo(7£;Fj 
is complete for {E, F). 

Proof. Being a union of cylinders in E, F is a cylinder in E. Being cylinders 
in E, the FiS are endowed with reduced logograms. This is to say that, 
for i = l,..,a{n) and any x E E, one has a; G -Fj if and only if there is 
g e \LogE{Fi)\ such that x > g. Since the target set U„F„ = F is itself a 
cylinder in E, Equation [TSl holds. 

(I) Let / G \LogEFi\ U .. U \LogEFo,(^n)\ and let x be an input word of 
length n such that x E E and x > f. We must prove x G -F. 

Very obviously we have / G LogEFi U .. U LogEFa[n)- 

Since sequence Fi has a cardinality function a{n), then, for input words 
X E E oi length n, equation U„F„ = F can be rewritten F = Fi U .. U Fq,(„). 

Given Fi U .. U Fq,(„) = F, f E LogE{F) follows from Equation [181 Then 
X G F follows from x > f (taking into account that F is a cylinder in F). 

(II) Let a; G F be any input word of length |x| = n. Assume x G F. We 
must prove that there exists / G \LogEFi \ U .. U \LogEFa(n)\ such that x > f . 

Since x G F and F = Fi U .. U Fa(„), there exists 1 < i < a(n), such 
that x E Fi. 

Since Fj is a cylinder in F, the reduced logogram \LogE{Fi)\ exists. This 
implies that, if ?/ G F includes a string g G |Lo(7£;(Fj)| then certainly y G Fj. 
Conversely, if ?/ G Fj then ?/ includes at least a string G \LogE{Fi)\. But 
this is just to say that \LogE{Fi)\ is a complete subset of |Lo(7£;(F)| for Fj 
relative to F, which is to say, for problem (F, Fj). 

Since \LogE{Fi)\ is complete for Fj relative to F, it follows from x G Fj 
that there exists a string / G |Lo5f£;(Fj)| such that x > /. 

Then we also have / G |Lo(7£;Fi| U .. U |Lo(7£;Fq,(„) |. 

We have shown that, given any input word x G F such that |x| = n, one 
has X G F if and only if there exists a string / in \LogEFi\ U .. U \LogEFa(^n)\ 
such that / < X. Thus, iLo^fEFil U .. U \LogEFa{n)\ is a complete subset of 
\LogEF\ for F relative to F. □ 

7 Application to Boolean Formulas 

The encoding scheme that we adopt converts CNF formulas into words over 
E = {0, 1, 2}. In what follows F = CNF, F = SAT. 

We represent clauses over Xi, .., x„ by sequences of n codes from S. Code 
denotes absence of the variable, code 1 presence without minus, code 2 
presence with minus. E.g., clause Xi V X3 V — X4 becomes 1012. 

A whole formula is encoded as a sequence of clauses. We define F"™' = 
satisfiable formulas with n variables and m clauses. 



11 



We introduce the sequence 1/1,1/2, ■■ of solutions, and the corresponding 
sequence Fi,F2, .. or recursive subsets of F. Here the solutions Ui are value 
assignments. The cardinality function is a{n) = 2". We assume that F = 
SAT as well as the regions Fi,F2,.. are closed sets in E = CNF. Thus, 
all these sets are assumed to be relative cylinders in E. These assumptions 
correspond to known properties of SAT [U]. 

Essentially, our application consists in investigating whether \LogE{F)\ 
might possibly contain strings not already in some of the \LogE{Fi)\. Before 
we discuss the propositions that we were able to derive, let us spend a few 
words on the logogram of SAT. A string in \LogE{F"'"^)\ is a prescription 
that a word in F"-^ may or may not be conformant with. We may represent 
a string in \LogE{F"'"^)\ as a word of length nm over {b} U S. Example for 
n = m = 3: String bbllb2b2b prescribes that first clause shall include X3, 
second shall include xi and — X3, third shall include —X2. Note that strings 
in \LogE{F'^"^)\ only prescribe either 1 or 2 as values (by the minimality 
property of reduced logogram). 

Theorem 8. Problem {CNF, SAT) exhibits the strong internal independence 
property. 

Proof. We consider s distinct strings /i, .., fs in |Lo(7£;(F"™')|. Thus, regarded 
as a partial function, each /, will assign only values 1 or 2. We must prove 
that for each i = 1, ..,s there exists a string xi G E^"^ = CNF'"-"^ such that 
Xi includes and does not include any of the remaining strings fi, .., fs- 

Let i be any one of the indices 1, ..,s. Then Dom{fi) C {1, ..,nm} and, 
for all h G Dom{fi), we either have fi{h) = 1 or fi{h) = 2. 

Let Xi be that word of length nm over S = {0,1,2} such that for all 
h G Dom{fi) it holds that Xih = fi{h) while for h not in Dom{fi) one has 
Xih = 0. Then certainly Xi includes fi. 

Let fj be any one of the strings fi,..,fs being different from fi. Thus, 
fj fi- We must prove that Xi does not include fj. 

(I) Assume Dom{fj) = Dom{fi). 

Since fi and fj are different, there is G Dom{fi) such that fi{k) 7^ fj{k)- 
But Xik = fi{k), then Xik 7^ fj{k). Then Xj does not include fj. 

(II) Let Assume Dom{fj) 7^ Dom{fi). 

Then either there is a G Dom{fj) such that a ^ Dom{fi) or there exists 
b G Dom{fi) such that b ^ Dom{fj). 

Assume that a exists. Then Xi does not include fj since Xia = while 
fj{a) ^ 0, hence Xia 7^ /}(a.)- Analogously, Xi does not include fj in case b 
exists. □ 

Theorem 9. The reduced logogram \LogcNF{SAT)\ does not contain wiz- 
ards. 
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Proof. We must prove: 

LogEFr U .. U LoqeF^^) = LogE{F^n (20) 

where a{n) is the cardinahty function of sequence Fi,F2, ... Here Fi is the 
range of the value assignment yi (set of formulas in F that are satisfied by yi) 
and is a cylinder in E. Since Equation [18] holds, we just have to prove that 
the right-hand side of Equation [20] does not contain wizards. We actually 
will prove: 

\LogEFr\ U .. U \LogEF^i:)\ = |Lo(7s(F"™)| (21) 

which is evidently equivalent to Equation [201 

We write K^"^ for \LogE{F"'^)\ and, for every integer i = 1, ..,a{n), we 
write = \LogE{FJ'"')\. We must prove iT"™ = U .. U K^'^^y 

First of all, note that the set of all witnesses K^"^ U .. U K^^^ is complete 
for the target set F = SAT relative to reference set E = CNF by Theorem [7] 
This implies that, if x G F""^, then x includes a string / e K"™- U .. U K^^y 

Let h e i^"™. Since /sT"'" is included in T.^[E), we have h G ^oo{E). 
Then there is an x G E"^™- such that x > h. On the other side, if x is in E"-"^ 
and includes string h, then x E F, hence, since K^"^ U .. U -R'q^JJ) is complete 
for F relative to E, there shall exist a string k G fCJ*™ U .. U K^}^^ such that 
X > k (and /i, shall have to be compatible to one another). 

We then set h —>■ k to mean that (i) A; is a member of K^'^ U .. U K^J^y 
(ii) there exists x G E""^ such that both h < x and k < x. (Thus, h ^ k 
implies that h and k are compatible.) 

Besides, we introduce the set U{h) = {k\h — ^ k} of those witnesses 
(members of set K^"^ U .. U K^J^^) that are related to h. 

Now, by way of contradiction, we assume that h does not belong to U (h). 

We then have that the elements in the set {h} U U{h) are all distinct. 

By the strong internal independence of SAT, in correspondence to each 
string / G {h} U U{h) there exists a word x G E"'^ such that f < x and for 
no (7 G {h} U U{h) being distinct from / one has g < x. 

Let X G E"''^ be such that x > h and for no g & U{h) one has the 
inclusion g < x. Word x is in F = SAT since x includes h which is an 
element of \LogE{F^"^)\. Besides, x does not contain any element from U{h). 
But that in turn means that x does not contain any strings from the witset 
U .. U K2}^y (Should X include a string k from U .. U K^}^^ that 
would mean that both x > h,x > k would hold, hence k would be related 
with h which would imply k G U{h).) 

This is absurd, since K]^"^U..UK2}^^^ is complete for SAT relative to CNF. 
We conclude that /i is a member of U{h), and hence is in U .. U K^J^y 
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Since we already know that K^"^ U .. U K^J^^ is a subset of K"'^, we conclude 
that fsT'"™ = U .. U K^'^y Thus, SAT has no wizards. □ 

Theorem 10. The reduced logogram \LogcNF{SAT)\ is irreducible. 

Proof. Let g G \LogE{F'"'^)\- By Theorem [9] we know that g must be a 
witness. Thus, 5^ is a string conveying the specification of exactly one value 
assignment. Besides, g is minimal (no proper restriction of is a sufficient 
condition for event x E F). These two facts make it a straightforward task 
to specify the general shape that string g shall exhibit. 

First of all, Dom{g) shall have to be a set of exactly m numbers taken 
from {1, .., nm}. The first of these numbers is to be taken from the first block 
{l,..,n} (where the first clause is allocated), the second is to be from the 
second block {n + 1, ..,2n},.., the mth is from the mth block {n{m — 1) + 
l,..,nm} (where the last clause is allocated). Thus, there are nm possible 
determinations for Dom{g). We know that regarded as a prescription, g can 
only prescribe the two values 1 and 2. (To help intuition, string g can be 
thought of as a sequence of fiats bb..b of length nm in which some of the fiats 
(as many as m) have been replaced with Is or 2s.) 

With any g that satisfies the above requirements we associate a formula 
7(5') as follows. We note that, regarded as a prescription, g prescribes the 
presence of exactly one literal in each clause of a formula x consisting of m 
clauses: We then state that the ith clause of 7(5') shall consist of exactly the 
single literal that g prescribes to the ith clause of x. 

Evidently, 7(5') is satisfiable and g < 7(5'). We claim that 7(5') does not 
include members of \LogE{F"'''^)\ other than g. 

Indeed, the strings in \LogE{F"'"^)\ never prescribe as value, and g is 
the largest string being included in the codeword of 7(5') which does not 
prescribe as value. Thus, the only strings that do not prescribe as value 
and happen to be included in the codeword of 7(5') are exactly string g itself 
and the proper restrictions of string g. Since g is minimal, all of its proper 
restrictions are not members of \LogE{F'^"^)\. Thus g is the only string being 
included in the codeword of 7(5') to be found in \LogE{F"-"^)\. 

Hence, iLogEiF"-^)] — {g} is not complete for F"™ relative to E"™. □ 

8 SAT as Search Problem 

The search version of a decision problem consists in obtaining solutions for 
a given instance x. Thus, with any NP problem {E, F) we associate the 
following search problem: Given x find a solution y for x or state that no 
such y exists. 
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It is known that, by self-reducibility of SAT, if we had a polynomial 
algorithm for SAT, then we would also have a polynomial algorithm for the 
search problem associated with SAT [9]. The results of previous sections 
show that we can say more: It is impossible to solve SAT without at the 
same time solving the search problem associated with SAT. 

These remarks suggest that we may wish to focus on the search problem 
associated with SAT. This is what we do in this section. 

Given any NP problem {E,F), we introduce the cover of the target set 
F associated with \LogE{F)\ to be the family of sets 



Its members are the charts or else regions of the cover. The cover that is 
associated with the kernel of a program P solving [E, F) is then 



Both Ve{F) and J^p{E, F) are families of subsets of the target set F whose 
union is F, with Tp{E, F) being a subfamily of Ve{F). 

For SAT we have the following situation: Tp{E,F) = Ve{F) by Theo- 
rem [TU] and the strings in \LogE{F)\ are all witnesses by Theorem O Thus 
any of these strings, call it g, has an associated relativized cylinder ExpE^g) 
being fully included in only one of the regions FiS. 

Since for E = CNF, F = SAT, ExpE{g) is actually an intersection of 
two absolute cylinder sets Exp{g) and E, then ExpE^g) itself is an absolute 
cylinder. In general, ExpE{g) will intersect certain other regions F^, F^,.. 
, but there exists only one region Fj which completely includes ExpE{g)- 
Besides, every region Fi shall have to include at least one such elementary 
relativized cylinder ExpE^g)- 

As a consequence, the cardinality of the cover Ve{F) cannot be smaller 
than that of the family of sets {F" : i = 1, .., 2"}, hence it is exponential. 

Remarks on the Time Complexity of SAT In the rest of this section 
we make remarks on the time complexity of SAT in the light of Theorems 8, 
9, 10. We will be less formal than in previous sections. Our remarks consist 
of two parts: 

Part One It follows from Theorem [TO] that there is a unique subfamily 
of Ve{F) such that F = IJ-^, namely Ve{F) itself. As a consequence, for 
any proper subset JF c Ve{F) one has F 7^ IJJF. 

We then have that it cannot be that Tp{E,F) is a proper subfamily of 
the full cover Ve{F), otherwise we would have F 7^ IJ jFp(F, F), and then P 



Ve{F) = {ExpEig) ^F:ge |Lo^e(F)|}. 



(22) 



Tp{E,F) = {ExpEig) '^F-.ge Ker{P)}. 



(23) 
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could not be correct as a program. In particular, since T>e{F) is exponential, 
J^p{E, F) is not allowed to be a polynomial subfamily of Ve{F) ): No search 
algorithm for SAT can only search a polynomial family of sets. 

Part Two It remains for us to discuss the possibility that one single algo- 
rithm can solve the full search problem for x by directly searching the full 
exponential family T>e{F) in polynomial time. However this can scarcely be 
the case due to complete absence of any form of dependence among subsets 
in the reduced logogram \LogE{F) \ for E = CNF, F = SAT. By this lack of 
internal dependence, any computation of a program P solving {CNF, SAT) 
is such that the result of any computation step does not change the results 
that are left possible for the subsequent steps. In the rest of this part we 
make a few informal remarks on how this lack of dependence comes into play. 

We take a general purpose program machine M as computation model. 
(That M is a program machine means that the process carried out by M 
is determined by a running program.) We assume that only one program 
is running at any moment of time within M. We keep machine M fixed 
while we consider an infinite set of programs solving SAT (actually the set 
of all programs that run on M and solve SAT). We emphasize that the 
hardware is kept fixed while different programs all running on that hardware 
are compared. 

Let B{x, m) be a program which for any given input x of size n and every 
integer m between 1 and 2" will decide if x has solutions in the range between 
yi and i/m- Take TimeB{x,m) be the number of time units that B uses on 
inputs X, m. 

We will make remarks that convey evidence for following statement: If 
for any x and m < 2" we have TimeB{x,m) = TimeB{x,m + 1), then 
we may replace B with a new program C running on M and such that 
Timec{x, m) < Timec{x, m + 1) = TimeB{x, m + 1). 

Indeed, under the above hypotheses on M, we can speak of the the class 
of all programs B, C,.. that solve SAT on machine M, and we can introduce 
a most efficient program A in this class. We understand that A is a most 
efficient program as soon as TimeA{x,2^) < Timec{x,2^) for any other 
program C on any input word x. 

It is sufficient for us to give a hint for TimeA^x, 1) < TimeA^x, 2). 

Our hint is the following. Since, by TheoremfHl we have \LogE{FiUF2) \ = 
\LogE{Fi)\U\LogE{F2) \ and \LogE{Fi)\r\\LogE{F2)\ = 0, a computation that 
implements the collection of tests in \LogE{Fi U ^2)! consists of two distinct 
computations, one implementing collection \LogE{Fi)\ and the other imple- 
menting collection \LogE{F2)\. Thus, computation A{x,l) being a proper 
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prefix of computation A{x, 2) is compatible witli assumed optimality of A, 
wlience TimeA{x, 1) < TimeAix, 2). 

9 On Ascribing Knowledge to Programs 

Our tlieory has roots in tlie body of formalized concepts referred to as Scott's 
theory of computation [TU]. Thus, the reduced logogram \LogE{F)\ associ- 
ated with problem {E, F) is an information system , ^ (however, the 
very important relation is not entailment but entanglement). Even more 
relevant are the relationships with the "dynamical" part of Scott's theory, 
the one regarding computations as sequences of steps through which the run- 
ning program's knowledge increases [3]. We also, in this latter respect, used 
concepts from the model theoretic analysis of program knowledge [llj. 

In this section we briefly review relationships of the above theory with 
formalisms that ascribe knowledge to a running program. 

In Scott's theory the computations that program P does are functionally 
equivalent to sequences of tokens (or tests) being consistent with the input 
string X. In our developments, the "tests" or "tokens" are identified with 
the strings in Ker{P). In Scott's theory, the state of knowledge of a running 
program P consists of a pile of assertions. These are consistent (indeed, they 
are propositions that are true of one and the same object x). As soon as the 
pile becomes a decisive one, the program makes its decision and stops. Our 
addition is: The "assertions" are of the form x G Exp{g) or else x ^ Exp{f) 
where /, G Ker{P). 

Searching x for a string g amounts to same as asking if x happens to 
belong in the absolute elementary cylinder Exp{g) associated with g. We 
thus arrive at the conclusion that all that P can possibly do to make a 
decision consists in asking questions of this form. Thus, the computations 
that P performs are just sequences of tests in disguise. Note that P has not 
got to ask whether x is in Expsig) since P already knows that x is in E. 
(This is an important point since asking if x is in Expsig) would be more 
comput at ionally exp ensi ve . ) 

In this theory, information regarding x is acquired by P in lumps. The 
acquisition of a piece of information occurs at the moment when the exe- 
cution of a sequence of tests is completed (i.e., when the computation that 
implements that sequence of tests is completed). We may well think of a 
piece of information as being a piece of paper carrying a written note such 
as "x is in Exp{g)" or "a; fails to be in Exp{g)." These notes stack one upon 
the other until the pile becomes a decisive one: This is the case when the 
data that was gathered entails one of the events x G F or else x E E — F . 
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Note that loading an input x in memory does not imply computations, 
hence no tests are made on x while loading, hence no knowledge is acquired 
about X. After loading x, the pile of assertions that represents program's 
knowledge is empty. 

10 Conclusions 

We advocated strings (with special meaning for the term) as a fundamen- 
tal notion for studies of computation. So to speak, strings are needed to 
express the notions of internal and strong internal independence of a deci- 
sion problem that underly our theory of decision problems. We were led to 
formulate strings to become able to derive the very basic notion of internal 
independence of a decision problem. Strings seem to be useful since they are 
absolutely elementary. Note that they are already at work in Computability. 
The "restrictions" that are often used in the study of circuit complexity are 
finite Boolean versions of the strings [S]. 



Strings are not made of consecutive letters. A string can be interspersed in 
a word: By canceling zero or more letters in a word x, and by leaving blanks 
in places of letters, we get a string / which is a substring of the original word 
X. In a string, one has information associated with spaces between letters 
(and hence with possible multiple periodicity with which letters may occur). 
As soon as we have the strings, we are able to define the kernel Ker{P) of 
a decision program P, a set of strings which capture structural features of 
both program P and the decision problem {E, F) that P solves. 

Ker{P) is a subset of the reduced logogram \LogE{F)\ of target set F 
in base E. The reduced logogram consists of substrings of the words in F 
which exhibit the following property: If a word in E includes one of these 
substrings then it belongs to F. We may think of the strings in \LogE{F)\ 
as kind of genes of the words in F. (In early notes the logogram was the 
jinnee or genie of problem {E,F).) The idea clearly comes from biology, 
where it is known that certain occurrences at given intervals of certain letters 
within DNA sequences convey structural information, and yield observable 
characters in the macroscopic development of the structures. 

Our application to SAT uses a structural property of that problem that 
seems to have escaped attention so far. We called it "strong internal inde- 
pendence." Theorem 8 shows that SAT exhibits the strong internal inde- 
pendence property. Theorem 9 shows that, by that property, SAT cannot 
have collective certificates in its reduced logogram. As a consequence, all the 
programs that solve SAT have same kernel (Theorem 10). 

The remarks in Section 8 suggest how Theorems 8, 9, 10 can possibly be 
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used to put SAT under scrutiny. Our ultimate concern in this paper has been 
to set forth our developments as a possible new technique to attack decision 
problems, where "technique" is here used in the sense that Hemaspaandra 
and Ogihara gave to this term in the preface of their "Companion." 
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